ROXXI: Reviving witness dOcuments to eXplore eXtracted Information
نویسندگان
چکیده
In recent years, there has been considerable research on information extraction and constructing RDF knowledge bases. In general, the goal is to extract all relevant information from a corpus of documents, store it into an ontology, and answer future queries based only on the created knowledge base. Thus, the original documents become dispensable. On the one hand, an ontology is a convenient and non-redundant structured source of information, based on which specific queries can be answered efficiently. On the other hand, many users doubt the correctness of facts and ontology subgraphs presented to them as query results without proof. Instead, users often wish to verify the obtained facts or subgraphs by reading about them in context, i.e., in a document relating the facts and providing background information. In this demo, we present ROXXI, a system operating on top of an existing knowledge base and reviving the abandoned witness documents. In doing so, it goes the opposite way of information extraction approaches – starting with ontological facts and tracing their way back to the documents they were extracted from. ROXXI offers interfaces for expert users (SPARQL) as well as for non-experts (ontology browser) and provides a ranked list of documents each associated with a content snippet highlighting the queried facts in context. At the demonstration site, we will show the advantages of this novel approach towards document retrieval and illustrate the benefits of reviving the documents that information extraction approaches neglect.
منابع مشابه
[Opening].
a. Witness Professor Reyntjens (PW77) asked permission to address the Chamber. First, he advised the Chamber that he had made an error in an answer he had given yesterday and offered further explanations. Second, he advised the Chamber that he had been given documents yesterday by Counsel for Bagosora (Mr. Constant), some of which were stamped “confidential” and which he recognized as being doc...
متن کاملInformation literacy in public libraries from the perspective of public libraries’ policymakers; an exploratory study
Purpose: The present paper aims to conduct an exploratory study on the status of information literacy in upstream documents and curriculums of Iran public libraries institutions for public libraries. Methodology: This is a developmental exploratory-qualitative study in terms of purpose. Research data were collected using in-depth, semi-structured interviews with policymakers and officials of p...
متن کاملXQuery adaptation for multimodal retrieval of multimedia documents
Recent years witness a phenomenal growth of multimedia data in various modalities, such as image, video, audio, and graphic, which poses a challenge of finding an efficient information retrieval technology. Rather than monotonous, single-modal information, users would like to have a multimodal system to query multimedia documents. In this paper, we present our propositions in two parts, the fir...
متن کاملBook Ii Chapter Xi De La Probabilité Des Témoignages
We have extracted a ball from an urn which contains n− 1 black balls and one white ball. A witness to the drawing announces that the extracted ball is white; we demand the probability of this exit. If the number n is very great, that which renders extraordinary the exit of the white ball, the probability of the error or of the falsehood of the witness becomes quite near to certitude, that which...
متن کاملبازخوانی اسناد کتیبهای غیرمنقول در میراث جهانی مجموعه بازار تاریخی تبریز
Immovable inscriptions are considered as one of the most important works and among the historical documents in cultural assets of our dear country, which were installed on selected parts of historical buildings and outstanding monuments and were always noticeable. The role of inscriptions as the basic and effective tools is important in terms of manifesting and implication of educational and ed...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 3 شماره
صفحات -
تاریخ انتشار 2010